Core-Tag Clustering for Web 2.0 Based on Multi-similarity Measurements
نویسندگان
چکیده
Along with the development of Web2.0, folksonomy has become a hot topic related to data mining, information retrieval and social network. The tag semantic is the key for deep understanding the correlation of objects in folksonomy. This paper proposes two methods to cluster tags for core-tag by fusing multi-similarity measurements. The contributions of this paper include: (1) Proposing the concept of core-tag and the model of core-tag clusters. (2) Designing a core-tag clustering algorithm CETClustering, based on clustering ensemble method. (3) Designing a second kind of core-tag clustering algorithm named SkyTagClustering, based on skyline operator. (4) Comparing the two algorithms with modified K-means. Experiments show that the two algorithms are better than modified K-means with 20-30% on efficiency and 20% higher scores on quality. Keyword: folksonomy, tag, clustering, clustering ensemble, skyline.
منابع مشابه
A Personalized Tag-Based Recommendation in Social Web Systems
Tagging activity has been recently identified as a potential source of knowledge about personal interests, preferences, goals, and other attributes known from user models. Tags themselves can be therefore used for finding personalized recommendations of items. In this paper, we present a tag-based recommender system which suggests similar Web pages based on the similarity of their tags from a W...
متن کاملWeb Clustering Based On Tag Set Similarity
Tagging is a service that allows users to associate a set of freely determined tags with web content. Clustering web documents with tag sets can eliminate the time-consuming preprocess of word stemming. This paper proposes a novel method to compute the similarity between tag sets and use it as the distance measure to cluster web documents into groups. Major steps in this method include computin...
متن کاملTag Clustering with Self Organizing Maps
© Tag Clustering with Self Organizing Maps Marco Luca Sbodio, Edwin Simpson HP Laboratories HPL-2009-338 SOM, clustering, machine learning, folksonomy, tagging, web 2.0 Today, user-generated tags are a common way of navigating and organizing collections of resources. However, their value is limited by a lack of explicit semantics and differing use of tags between users. Clustering techniques th...
متن کاملAn Empirical Comparison of Distance Measures for Multivariate Time Series Clustering
Multivariate time series (MTS) data are ubiquitous in science and daily life, and how to measure their similarity is a core part of MTS analyzing process. Many of the research efforts in this context have focused on proposing novel similarity measures for the underlying data. However, with the countless techniques to estimate similarity between MTS, this field suffers from a lack of comparative...
متن کاملSimSpectrum: A Similarity Based Spectral Clustering Approach to Generate a Tag Cloud
Tag clouds are means for navigation and exploration of information resources on the web provided by social Web sites. The most used approach to generate a tag cloud so far is based on popularity of tags among users who annotate by those tags. This approach however has several limitations, such as suppressing number of tags which are not used often but could lead to interesting resources as well...
متن کامل